Enriching a Treebank to Investigate Relative Clause Extraposition in German

نویسنده

  • Jan Strunk
چکیده

I describe the construction of a corpus for research on relative clause extraposition in German based on the treebank TüBa-D/Z. I also define an annotation scheme for the relations between relative clauses and their antecedents which is added as a second annotation level to the syntactic trees. This additional annotation level allows for a direct representation of the relevant parts of the relative construction and also serves as a locus for the annotation of additional features which are partly automatically derived from the underlying treebank and partly added manually. Finally, I also report on the results of two pilot studies using this enriched treebank. The first study tests claims made in the theoretical literature on relative clause extraposition with regard to syntactic locality, definiteness, and restrictiveness. It shows that although the theoretical claims often go in the right direction, they go too far by positing categorical constraints that are not supported by the corpus data and thus underestimate the complexity of the data. The second pilot study goes one step in the direction of taking this complexity into account by demonstrating the potential of the enriched treebank for building a multivariate model of relative clause extraposition as a syntactic alternation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Self Embedded Relative Clauses in a Corpus of German Newspaper Texts

The distribution of center self-embeddings and extrapositions in German is assumed to reflect a universal performance strategy of minimizing memory load during parsing. Self-embedded relative clauses of embedding depth 2 were semi-automatically analysed in a treebank of German newspaper texts. Clause length and especially extraposition distance are found as the main distinctive parameters betwe...

متن کامل

Relative Clause Extraposition in German: an efficient and portable implementation

In this paper, I propose an implementation of relative clause extraposition in German. The proposal builds on Kiss (in press) who treats relative clause extraposition as an anaphoric process by means of percolation of anchors to which the relative clause is bound. I discuss several sources of spurious ambiguity in Kiss’s original formulation and suggest a two-step percolation of anchors that cr...

متن کامل

Amalgam: A machine-learned generation module

Amalgam is a novel system for sentence realization during natural language generation. Amalgam takes as input a logical form graph, which it transforms through a series of modules involving machine-learned and knowledge-engineered sub-modules into a syntactic representation from which an output sentence is read. Amalgam constrains the search for a fluent sentence realization by following a ling...

متن کامل

The Information Structure of Subject Extraposition in Early New High German

This paper investigates the information-structural characteristics of extraposed subjects in Early New High German (ENHG). Based on new quantitative data from a parsed corpus of ENHG, I will argue that unlike objects, subjects in ENHG have two motivations for extraposing. First, subjects may extrapose in order to receive narrow focus, which is the pattern Bies (1996) has shown for object extrap...

متن کامل

Studien zur performanzorientierten Linguistik Aspekte der Relativsatzextraposition im Deutschen

Looking at relative clause extraposition in German as a concrete example, the paper demonstrates how linguistic model building, corpus study and psycholinguistic experiments combine into an integrational research programme that aims at an improved understanding and linguistically as well as cognitively adequate modelling of human language performance. Starting from the word order theory articul...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010